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Abstract 

Spatial interaction between two or more classes or species has important implications in various fields 
and causes multivariate patterns such as segregation or association. Segregation occurs when members 
of a class or species are more likely to be found near members of the same class or conspecifics; while 
association occurs when members of a class or species are more likely to be found near members of another 
class or species. The null patterns considered are random labeling (RL) and complete spatial randomness 
(CSR) of points from two or more classes, which is called CSR independence, henceforth. The clustering 
tests based on nearest neighbor contingency tables (NNCTs) that are in use in literature are two-sided 
tests. In this article, we consider the directional (i.e., one-sided) versions of the cell-specific NNCT-tests 
and introduce new directional NNCT-tests for the two-class case. We analyze the distributional properties; 
compare the empirical significant levels and empirical power estimates of the tests using extensive Monte 
Carlo simulations. We demonstrate that the new directional tests have comparable performance with the 
currently available NNCT-tests in terms of empirical size and power. We use four example data sets for 
illustrative purposes and provide guidelines for using these NNCT-tests. 
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1 Introduction 



Spatial point patterns have important implications in epidemiology, population biology, ecology, and other 
fields, and have been extensively studied. Most of the research on spatial patterns from the early days on 
pertains to patterns of one type of points; i.e., to spatial pattern of a type of points with respect to the ground 
(e.g., density, dumpiness, etc.). These patterns for only one type of points usually fall under the pattern 
category called spatial aggregation (Coomes et al. (1999)), clustering, or regularity. However, it is also of 
practical interest to investigate the spatial interaction of one type of points with other types (Pielou (1961)). 
The spatial relationships among two or more types of points have interesting consequences, especially for 
plant species. See, for example, Pielou (1961), Pacala (1986), and Dixon (1994, 2002a). For convenience and 
generality, we refer to the different types of points as "classes" , but class can stand for any characteristic of 
an individual at a particular location. For example, the spatial segregation pattern has been investigated for 
species (Pielou (1961), Whipple (1980), and Diggle (2003)), age classes of plant species (Hamill and Wright 
(1986)), fish species (Herler and Patzner (2005)), and sexes of dioecious plants (Nanami et al. (1999)). Many 
of the epidemiologic applications are for a two-class system of case and control labels (Waller and Gotway 
(2004)). 

Many univariate and multivariate (i.e., one-class and multi-class) tests have been proposed for testing 
segregation of two classes in statistical and other literature (KuUdorff (2006)). These include comparison 
of Ripley's K{t) functions (Diggle and Chetwynd (1991)), comparison of NN distances (Diggle (2003)), and 
NNCTs (Pielou (1961) and Dixon (1994)). Pielou (1961) proposed various tests based on NNCTs for the 
two-class case only and Dixon (1994) introduced an overall test of segregation and class-specific tests based 
on NNCTs for the two-class case and extended his tests to multi-class case (Dixon (2002a)). For the two- 
class case, Ceyhan (2008b) discussed these tests and demonstrated that Pielou's test is liberal under GSR 
independence or RL and is only appropriate for a random sample of (base, NN) pairs. If ?; is a NN of 
point u, then u is called the base point and v is called the AW point. He also suggested the use of Fisher's 
exact test for NNGTs and evaluated its variants and the exact version of Pearson's test in (Geyhan (2006)). 
Furthermore, Geyhan (2008c) proposed new cell-specific and overall segregation tests which are more robust 
to the differences in the relative abundance of classes and have better performance in terms of size and power. 

In literature, most segregation tests are two-sided tests for the two-class case or against a general alternative 
for the multi-class case. In particular, the NNGT-tests in literature are not directional tests. In this article, we 
discuss the directional (i.e., one-sided) versions of the cell-specific tests of Dixon (1994, 2002a) and Ceyhan 
(2008c) and propose new directional segregation tests. We compare these tests in terms of distributional 
properties, and empirical size and power through extensive Monte Carlo simulations. We also compare these 
tests with Ripley's K or L-functions (Ripley (2004)) and pair correlation function g{t) (Stoyan and Stoyan 
(1994)), which are methods for second-order analysis of point patterns. We only consider completely mapped 
data; i.e., for our data sets, the locations of all events in a defined area are observed. We show through 
simulation that the newly proposed directional tests perform similar (only slightly better in power) to the 
cell-specific tests of Ceyhan (2008c) but perform better (in terms of empirical size and power) than Dixon's 
cell-specific tests. Furthermore, we demonstrate that our tests and Ripley's L-function and related methods 
(i.e., second-order analysis) answer different questions about the pattern of interest. 

"Dependence" in this article refers to the dependence in the probabilistic sense between the cell counts 
which results from the spatial dependence between the points from spatial point patterns. Spatial dependence 
between points in a particular pattern is a well known phenomenon and has been extensively studied. In 
mathematical statistics, "spatial dependence" is used as a measure of the degree of spatial interaction between 
independently measured observations from a temporally or spatially ordered set of points. The cause of spatial 
dependence is not the sample selection, nor sample preparation, nor the measurement order (Hald (1952)). 
However, in practice, the spatial data show dependence due to the order they occupy in space and time. 
Modeling such dependence usually causes problems in the statistical analysis because time is unidirectional, 
and space is omnidirectional (see, for example, Pace et al. (2000)). Nearest neighbor spatial dependence is a 
consequence of this spatial dependence (Pace and Zhou (2000)). Just as the spatial patterns have been mostly 
analyzed for one-class patterns, spatial dependence is usually investigated for one class only. For example, 
indices of dependence (for clustering or regularity) are proposed by van Lieshout and Baddeley (1996) and 
are extended for multi-type point patterns in (van Lieshout and Baddeley (1999)). In the latter article, the 
authors introduce a dependence index whose values, if larger than 1, indicate inhibition (similar to what we 
call segregation in this article), and if smaller than 1, indicate positive association (similar to what we call 



association in this article) and equals to 1 for Poisson point patterns. Hence, segregation and association can 
be viewed as two opposite types of spatial dependence between two or more classes. 

For simplicity, we discuss the spatial clustering patterns between two classes only; the extension to the case 
with more classes is straightforward for the cell-specific tests. We discuss the null and alternative patterns 
in Section 2, describe the construction of the NNCTs in Section 3, discuss Dixon's and Ccyhan's cell-specific 
segregation tests in Sections 4.1 and 4.2, and introduce directional version of Piclou's test of segregation in 
Section 4.3, and new directional tests in Section 4.4. We provide the empirical significance level analysis under 
CSR independence in Section 5, empirical power analysis under segregation and association alternatives in 
Section 6, and illustrate the tests in example data sets in Section 7, provide discussion and guidelines for 
using the tests in Section 8. 

2 Null and Alternative Patterns 

The null hypothesis in the univariate spatial point pattern analysis is usually complete spatial randomness 
(CSR) (Diggle (2003)). There are two benchmark hypotheses to investigate the spatial interaction between 
multiple classes in a multivariate process: (i) independence, which implies the classes of points are generated 
by independent univariate processes and (ii) random labeling (RL), which implies that the class labels are 
randomly assigned to a given set of locations in the region of interest (Diggle (2003)). In this article, our null 
hypothesis is 

Ho : randomness in the NN structure 

which might result from two random pattern types: CSR of points from two classes (this pattern will be 
called the CSR independence, henceforth) or RL. In the CSR independence pattern, points from each of the 
two classes independently satisfy the CSR pattern in the region of interest. 

Although CSR independence and RL are not same, they lead to the same null model for NNCT-tcsts, since 
a NNCT does not require spatially-explicit information. That is, when the points from two classes arc assumed 
to be independently uniformly distributed over the region of interest, i.e., under the CSR independence 
pattern, or when only the labeling (or marking) of a set of fixed points (where the allocation of the points 
might be regular, aggregated, or clustered, or of lattice type) is considered, i.e., under the RL pattern, there 
is randomness in the NN structure. We discuss the differences in practice and theory for cither case. The 
distinction between CSR independence and RL is very important when defining the appropriate null model 
in practice; i.e., the null model depends on the particular ecological context. Goreaud and Pclissicr (2003) 
state that CSR independence implies that the two classes are a priori the result of different processes (e.g., 
individuals of different species or age cohorts), whereas RL implies that some processes affect a posteriori 
the individuals of a single population (e.g., diseased vs. non-diseased individuals of a single species). We 
provide the differences in the proposed tests under these two patterns. For a more detailed discussion of CSR 
independence and RL patterns, sec (Ccyhan (2008d)). 

As clustering alternatives, we consider two major types of spatial patterns: segregation and association. 
Segregation occurs if the NN of an individual is more likely to be of the same class as the individual than to 
be from a different class; i.e., the members of the same class tend to be clumped or clustered (see, e.g., Pielou 
(1961); Dixon (1994); and Coomcs et al. (1999)). For instance, one type of plant might not grow well around 
another type of plant and vice versa. In plant biology, one class of points might represent the coordinates of 
trees from a species with large canopy, so that other plants (whose coordinates are the other class of points) 
that need light cannot grow (well or at all) around these trees. In epidemiology, one class of points might 
be the geographical coordinates of residences of cases and the other class of points might be the coordinates 
of the residences of controls. Furthermore, social and ethnic segregation of residential areas can be viewed 
as a special type of segregation. Given the locations of the residences, the ethnic identity or social status of 
the residents can be viewed as class labels assigned randomly or not. For example, the residents of similar 
social status or same ethnic identity might tend to gather in certain neighborhoods which is an example of 
segregation as opposed to RL of the residences. 

Association occurs if the NN of an individual is more likely to be from another class than to be of the 
same class as the individual. For example, in plant biology, the two classes of points might represent the 
coordinates of mutualistic plant species, so the species depend on each other to survive. As another example. 



the points from one class might be the geometric coordinates of parasitic plants exploiting the other plant 
whose coordinates are the points of the other class. 

The patterns of segregation and association do not only result from multivariate interaction between the 
classes. It is also conceivable to have either of these patterns without any interaction between the point 
processes; for example, consider the case where species happen to have the same or different fine-scale habitat 
preferences. Each of the two patterns of segregation and association are not symmetric in the sense that, when 
two classes are segregated (or associated), they do not necessarily exhibit the same degree of segregation (or 
association). For example, when points from each of two classes labeled as X and Y are clustered at different 
locations, but class X is loosely clustered (i.e., its point intensity in the clusters is smaller) compared to class 

Y so that classes X and Y are segregated but class Y is more segregated than class X. Similarly, when class 

Y points are clustered around class X points but not vice versa, classes Y and X are associated, but class 

Y is more associated with class X compared to the other way around. Although it is not possible to list all 
of the many different types of segregation (and association), its existence can be tested by an analysis of the 
NN relationships between the classes (Pielou (1961)). 

3 Nearest Neighbor Contingency Tables 

NNCTs are constructed using the NN frequencies of classes. The construction of NNCTs for two classes is 
described here; extension to multi-class case is straightforward. Consider two classes with labels 1,2 which 
stand for classes X and F, respectively. Let rii be the number of points from class j for i e {1, 2} and n be 
the total sample size. If the class of each point and the class of its NN were recorded, the NN relationships 
fall into four distinct categories: (1,1), (1,2); (2,1), (2,2) where in cell («,j), class i is the base class, while 
class j is the class of its NN. That is, the n points constitute n (base,NN) pairs. Then each pair can be 
categorized with respect to the base label (row categories) and NN label (column categories). Denoting Nij 
as the frequency of cell (i.e., the count of all (base,NN) pairs each of which has label (i, j)) for i,j G {1, 2} 
yields the NNCT in Table 1 where the column sum Cj is the number of times class j points serve as NNs 
for j G {1,2}. Furthermore, Nij is the cell count for cell {i,j) that is the sum of all (base,NN) pairs each of 
which has label (i, j). Notice that n ~ J2i j ^ij' ~ ! ^'^d Cj = X]i=i ^ij- By construction, if 

Nij is larger (smaller) than expected, then class j serves as NN more (less) to class i than expected, which 
implies (lack of) segregation ii i — j and (lack of) association of class j with class i li i ^ j . Furthermore, we 
adopt the convention that variables denoted by upper (lower) case letters are random (fixed) quantities. 
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Table 1: NNCT for two classes. 

Observe that column sums and cell counts are random, while row sums and the overall sum arc fixed 
quantities in a NNCT. Under segregation, the diagonal entries, Na for i = 1,2, tend to be larger than 
expected; under association, the off-diagonals tend to be larger than expected. The general alternative is that 
some cell counts are different than expected under CSR independence or RL. 

(Pielou (1961)) suggested the use of the Pearson's test of independence for NNCTs, but her test has 
been shown to be inappropriate in literature (see, e.g., Meagher and Burdick (1980)). The main problem 
with her test is that sampling distribution of the cell counts in the NNCTs is not correct. The assumption 
for the use of test for NNCTs is the independence between cell-counts (and rows and columns also) , which 
is violated for CSR independence or RL data (see Dixon (1994) and Ceyhan (2008b)). Dixon (1994) derived 
the appropriate (asymptotic) sampling distribution of cell counts under RL, hence his test is appropriate for 
CSR independence (Ceyhan (2008b)). Nevertheless, Ceyhan (2008b) demonstrated that all these tests are 
consistent, in the sense that under any alternative (of segregation or association), the power tends to one, as 
sample sizes tend to infinity. While Dixon's test has the appropriate nominal size under CSR independence, 
Pielou's test is liberal (Ceyhan (2008b)). Ceyhan (2006) also suggested the use of Fisher's exact test for 
NNCTs and evaluated its variants and the exact version of Pearson's test. 



4 Directional Segregation Tests Based on NNCTs 



In this section, we describe the samphng distribution of ceU counts for NNCTs, discuss directional versions 
of the cell-specific test for cell (1,1), directional version of Pielou's test of segregation, and introduce new 
directional tests of segregation. 



4.1 Dixon's Cell-Specific Test of Segregation 

Dixon (1994) proposed a series of tests for segregation based on NNCTs. In Dixon's framework, the probability 
of a class j point serving as a NN of a class i point depends only on the class sizes (row sums) , but not the 
total number of times class j serves as a NN (column sums). The level of segregation is tested by comparing 
the observed cell counts to the expected cell counts under RL of points that are fixed or a realization of 
points from CSR independence. Dixon demonstrated that under RL, one can write down the cell frequencies 
as Moran join count statistics (Moran (1948)). He then derived the means, variances, and covariances of the 
cell counts (frequencies) (Dixon (1994, 2002a)). 

The null hypothesis under CSR independence or RL is given by 

i^:E[A^,].H"-l)/;';-l) (1) 

where rii is the sample size for class i. Observe that the expected cell counts depend only on the size of each 
class (i.e., row sums), but not on column sums. Furthermore, 



{n + R)pii + {2n-2R + Q) Piii + (n^ - 3n - Q + R)piui - [npuY if i j, 
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with Pxx, Pxxxi a-iid Pxxxx StTC the probabilities that a randomly picked pair, triplet, or quartet of points, 
respectively, are the indicated classes and are given by 
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Furthermore, Q is the number of points with shared NNs, which occur when two or more points share a NN 
and R is twice the number of reflexive pairs. A (base,NN) pair {u, v) is reflexive if {v, u) is also a (base,NN) 
pair. Then Q = 2 {Q2 + 3 Q3 + 6 Q4 + 10 Q5 + 15 Q%) where Qk is the number of points that serve as a NN 
to other points k times. 



The test statistic suggested by Dixon is given by 



Z^^ = ^™ (4) 



where E[iVjj] is given in Equation (1) and Var[iVy] is given in Equation (2). One-sided and two-sided tests 
are possible for each cell using the asymptotic normality of Z^* given in Equation (4) (Dixon (1994)). 
In fact, asymptotic normality of the cell counts are rigorously proved by Dixon (1994) using the technique 
proposed by Cuzick and Edwards (1990) for the diagonal entries. For the two-class case, the asymptotic 
normality of the off-diagonal entries follow trivially; for the multi-class case, normality does only generalize 
to the diagonal entries, but Monte Carlo simulations suggest the normality of off-diagonal entries also (Dixon 
(2002a)). 

Remark 4.1. There are two major types of asymptotic structures for spatial data (Lahiri (1996)). In the 
first, any two observations are required to be at least a fixed distance apart, hence as the number of obser- 
vations increase, the region on which the process is observed eventually becomes unbounded. This type of 



sampling structure is called "increasing domain asymptotics" . In the second type, the region of interest is 
a fixed bounded region and more or more points are observed in this region. Hence the minimum distance 
between data points tends to zero as the sample size tends to infinity. This type of structure is called "infill 
asymptotics", due to Cressie (1993). The sampling structure in our asymptotic sampling distribution could 
be either one of infill or increasing domain asymptotics, as we only consider the class sizes and the total 
sample size tending to infinity regardless of the size of the study region. □ 



4.2 Cell-Specific Tests of Segregation in Ceyhan (2008c) 

In standard cases like multinomial sampling with fixed row totals and conditioning on the column totals, 
the expected cell count for cell in contingency tables is E[iVy] = NiCj/n. We first consider the 

difference Ay := Nij — Ni Cj/n for cell («, j). Notice that under RL, Ni — rii are fixed, but Cj are random 

^ fi ■ ( fi ■ — \\ fi^ 
quantities and Cj = 'ELi^ij^ ^cnce A,j = % - Then under RL, E[Ay] = — = 
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j) + 7^ j) (Ceyhan (2008d)). 
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For 7 = 1 and j = 2, 
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Notice that the expected value of is not zero under RL. Hence, instead of Ay, we suggest the following 
test statistic: 



Then E[T,j] = 0, since for i=j, 



and for i ^ j , 
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Furthermore, for i = 1 and j = 2, 
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Likewise E[T2i] = and E[T22] = 0. For the variance of Ty , we have 

/ Var[Ary] + (-zi^Var[Q] - 2^Cov[7Vy , C,] if ^ = ,, 
Var[JyJ — < (8j 
[ Var [iVy] + ^^Var [Q] - 2j^Cov[N,, , if z ^ j , 

where Var[iVy] are as in Equation (2), Var[Cj] = X^Li '^^^Wv]+J2k^z J2i Cov[N,^,Nkj] and Cov[iVij, Q] = 
Efc=i Cov[iVy,7Vfcj] with Cov[7Vy, iVfc;] arc as in Equations (4)-(12) of Dixon (2002a). The proposed cell- 
specific test in standardized form is 

ZC = . (9) 

" /VaiW 

Recall that in the two-class case, each cell count Nij has asymptotic normal distribution (Cuzick and 
Edwards (1990)). Hence, also converges in law to A'^(0, 1) as n — > oo. Moreover, one and two-sided 
versions of this test are also possible. 

Under CSR independence, the distribution of the test statistics above is similar to the RL case. The only 
difference is that Zf< asymptotically has A^(0, 1) distribution conditional on Q and R. 

Dixon's cell-specific test in Equation (4) depends on the frequencies of (base, NN) pairs (i.e., cell counts), 
and measures deviations from expected cell counts. On the other hand, Ceyhan's cell-specific test in Equation 
(9) is the difference of cell counts and column and row sums; in fact, it can be seen as a difference of two 
statistics and has zero expected value for each cell. 

In the two-class case, segregation of class i from class j implies lack of association between classes i and j 
{i 7^ j) association between classes i and j implies lack of segregation between them {i ^ j), since Zj^ = —Z:l^ 
for i = 1, 2. The same holds for the new cell-specific tests, since = — -^2j for j = 1, 2. 



4.3 Directional Version of Pielou's Test of Segregation 

Pielou (1961) constructed NNCTs based on NN frequencies which yield tests that are independent of quadrat 
size (see also Krebs (1972)) for two classes. She used Pearson's test of independence to detect any deviation 
from randomness in NN structure. The corresponding test statistic is given by 

^P^y (10) 

i=i j=i ^ 



where E[Afy] = (n^ Cj)/n with Cj being the observed sum for column j. Under CSR independence or RL, this 
test is hberal, i.e., has larger Type I error rate than the desired level (Ceyhan (2008b)). 

Piclou's test, when used for a NNCT based on a random sample of (base, NN) pairs, measures deviations 
from the independence of cell counts, but docs not indicate the direction of the deviation (e.g., segregation 
or association). To determine the direction, one needs to check the NNCT. Since Xl for large n, 

we can write Xp = Z|, where Zp '^pJJJ™ -/V(0, 1), where A^(0, 1) stands for the standard normal distribution. 
By some algebraic manipulations, among other possibilities, Zp can be written as 
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See (Bickel and Doksum (1977)) for the sketch of the derivation. For example, Zp could also be written as 
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We point out that these directional tests are not appropriate for testing CSR independence or RL, due to 
inherent dependence of cell counts in NNCTs for such patterns, but only appropriate for NNCTs based on a 
random sample of (base,NN) pairs. 



4.3.1 Empirical Correction of Directional Version of Pielou's Test 

We demonstrate in Section 5 that the directional and two-sided tests based on Zp (i.e., the directional version 
of Pielou's test) are both liberal in rejecting the null hypothesis. We adjust this test for location and scale based 
on Monte Carlo simulations to render it have the desired level, a. For the null case, we simulate the CSR inde- 
pendence pattern only, with classes 1 and 2 of sizes ni and 712, respectively. At each of Nmc = 10000 replicates, 
under Ho, we generate data for the pairs of (711,712) € {(10, 10), (10, 30), (10, 50), (30, 30), (30, 50), (50, 50), (100, 100), (200, 200)} 
points iid from Z^((0, 1) x (0, 1)), the uniform distribution on the unit square. These sample size combinations 
are chosen so that we can see the influence of small and large samples, and of the differences in the relative 
abundances on the tests. 

We record Zp values at each Monte Carlo replication for each sample size combination. In Figure 1, we 
present the kernel density estimates for directional Z-tests and the standard normal density (solid line) in 
order to make distributional comparisons. Observe that for balanced sample size combinations, there seems 
to be a need for scaling, and a mild adjustment in location; while for unbalanced sample size combinations, 
the location discrepancy between standard normal and kernel density estimates seem to be larger. 

We tabulate the sample means and variances of the test statistics in Table 2, where Zp is for the statistic 
defined in Equation (11). Since the empirical mean for Zp, although tending to zero for large ti, is negative 
for small samples, we also consider Zf, = Zp I{Zp < 0) for the association, and Zf, = Zp l{Zp > 0) for the 
segregation alternatives separately, where I(-) stands for the indicator function. 



Empirical Means and Variances of the Test Statistics 
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Table 2: The empirical means and variances for the test statistics. 

Let M{Zp) be the sample mean and V{Zp) be the sample variance of Zp values. Let M{Zp) and V{Zp), 
M{Zp) and V[Zp) be similarly defined for Zp and Zp values, respectively. Then Table 2 suggests that 
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Figure 1: The density plot of standard normal distribution (solid line) and the kernel density estimates of the 
directional z-scores for similar (left) and very different (right) sample size combinations. 

M(Z°) = -1.00 and ViZ'^) = 0.62; while M(^|,) = 0.96 and T^(^|,) = 0.59. On the other hand, M{Zp) 
does not stabilize at a value, but tends to zero; and V{Zp) is about 1.63. Since, we use the critical values 
based on standard normal distribution, after adjusting we want Zp ^ Z, Zp ^ Z^ and Zp ~ Z+, where 
Z- = ZI{Z < 0) and Z+ = ZI{Z > 0) with Z - iV(0, 1). Therefore Zp does not need adjusting for location 
(wc take the mean of Zp to be 0), but needs an adjustment in scale for variance. We transform the Zp scores 
as Z„ic := Zp//3n so that Var[Zr„c] = Var[Z] = 1. By numerical integration, we find the mean and variances 
of Z- and Z+ as E[Z+] = -E[Z-] = 0.798 and Var[Z+] = Va.r[Z-] = 0.363 (by symmetry). We transform 
the Zp and Zp scores by adjusting for location and scale as Z^^^^ := [Zp — aa) I Pa and Z!^^ :— [Zp — as) / (is 
so that E[Z^J « E[Z-] = -0.798 and Var[Z^J « Var[Z-] = 0.363; and E[Zf„J « E[Z+] = 0.798 and 
Var[Z^^] w Var[Z+] = 0.363 would hold. Such transformations will convert the Zp into a standard normal 
(approximately); and Zf, and Zp values into restricted standard normal (approximately), since the Figure 
1 suggests that Zp scores are approximately normal, but not standard normal. Using the sample estimate 
V{Zp) for Var(Zp). solving for /?„ yields /3„ ~ \/L63 = 1.277. Similarly, using the sample estimates 
M{Zf,) and V{Zf,) for E(Z|,) and Var(Z|,), solving for Ua and yields f3a ^ ^y(Z|,)/0.363 = 1.307 and 
aa = 0.043. Likewise, we find f3s = ^/V{Zf,)/ 0.363 = 1.275 and as = -0.057. 

4.4 New Directional Tests of Segregation 

In the directional version of Pielou's test using Zp in Equation (11), the quantities ni, n2 and n are fixed, 
while Ci, C2, and Nn, N21 are random quantities. Because of the fact that the product C1G2 is in the 
denominator under the square root, E[2'p] and Var[Zp] arc not analytically tractable, hence the correct 
(asymptotic) distribution of Zp is not available. Therefore, one way to fix Zp to have the appropriate 
Type I error rate is as in Section 4.3.1 or by randomization test. Meagher and Burdick (1980) propose and 
illustrate using Monte Carlo simulation to calculate critical values of Pielou's test statistic under RL. This is 
the same concept as our randomized (Monte Carlo corrected) test, but the details are not the same: Meagher 
and Burdick (1980) use a Monte-Carlo computation of the critical value while we use a Monte Carlo-based 
moment adjustment that is intended for use when the study region is rectangular and sample sizes are similar. 

Alternatively we modify Zp to make the distribution of it to be analytically tractable. To this end, let 

T^,= ^.^ and C/„ := .Z^^, then Zp = x/^[/„r„. Note that E[T„] = ("MM _ MM) = 
ni 712 S C1C2 \ ni n2 ) 



. Using the asymptotic normality of cell counts A^n and N21, we have 



(n-1) 

Tn - E[T„] \ c 



iV(0,l), 



where — > stands for convergence in law 



Var[T„] = 



VVar[r„] J 
Var[iVii] Var[7V2i] 2 Cov[iVii, iVzi] 



"1 



ni 712 



with Var[iVy] are as in Equation (2) and 

Cov[iVii,iV2i] = {n - R + Q)pii2 + {n^ - 3n - Q + R) P1112 - Pii P12 
(see Dixon (2002a) for the derivation). 
We propose two tests based on T„: 

The latter test statistic, Zu converges in distribution to A'^(0, 1) as n — > 00; while such convergence for 
Zi holds conditional on [/„■ However, note that for large n, [/„ w 1, since, letting i>i be the proportion of 
class i in the population, we have lim„.„.^oo ni/n = Vi\ and under CSR independence or RL, Ci/n Vi as 
n,ni —> 00 for i = 1,2. Furthermore, Zi and Zjj values are positive under segregation and negative under 
association alternatives. 

Remark 4.2. Extension to Multi-Class Case: For q classes, the NNCT we obtain will be of dimension q x g. 
The extension of the cell-specific tests to the g-class case with g > 2 is straightforward. But unfortunately, 
the asymptotic normality of the off-diagonal cells in these NNCTs is not rigorously established yet, although 
extensive Monte Carlo simulations indicate approximate normality for large samples (Dixon (2002a)). In 
the multi-class case, a positive z-score, Za, for the diagonal cell (i,i) indicate segregation, but it does not 
necessarily mean lack of association between class i and class j (i ^ j), since it could be the case that class i 
could be associated with another class, yet not associated with another one. Likewise for Ceyhan's cell-specific 
tests. The directional tests in Section 4.4 are designed for the two-class case only. □ 

Remark 4.3. The Status of Q and R under CSR Independence and RL: Q and R are fixed under 
RL, but random under CSR independence. The quantities given in Equations (1), (2), and all the quantities 
depending on these expectations also depend on Q and R. Hence these expressions are appropriate under the 
RL model. Under the CSR independence model they are conditional variances and covariances obtained by 
conditioning on Q and R. Hence under the CSR independence pattern, the asymptotic distributions of the 
tests in Equations (4), (9), and (12) are conditional Q and R. 

The unconditional variances and covariances can be obtained by replacing Q and R with their expectations 
(Ceyhan (2009)). Unfortunately, given the difficulty of calculating the expectation of Q under CSR indepen- 
dence, it is reasonable and convenient to use test statistics employing the conditional variances and covariances 
even when assessing their behavior under the CSR independence model. Cox (1981) calculated analytically 
that E[i?|iV] = 0.6215A^ for a planar Poisson process. Alternatively, one can estimate the expected values of 
Q and R empirically. For example, for homogeneous planar Poisson pattern, we have E[Q|A^] w 0.6328A^ and 
E[i?|A^] w 0.6211A^ (estimated empirically by 1000000 Monte Carlo simulations for various values of N on 
unit square). Notice that E[i?|A^] agrees with the analytical result of Cox (1981). When Q and R are replaced 
by 0.63 71 and 0.62 ri, respectively, the so-called QR-adjusted tests are obtained. However, QR-adjustment 
does not improve on the unadjusted NNCT-tests (Ceyhan (2008e)). □ 



5 Empirical Significance Levels under CSR Independence 



We only consider the two-class case with classes X and Y. We generate ni points from class X and n2 points 
from class Y both of which are uniformly distributed on the unit square (0, 1) x (0, 1) for some combinations 



of rii and 712. Thus, we simulate the CSR independence pattern for the performance of the tests under the 
null case. 

We present the empirical significance levels of the tests for the two-sided alternative in Table 3, where afj 
and Spj arc the empirical significance levels of Dixon's and Ceyhan's cell-specific tests for cell (i, i), i = 1, 2, 
respectively, is for the directional version of Pielou's test Zp, is for the empirically corrected version 
of Zp, aj and an are for the new directional tests provided in Equation (12). The sizes significantly smaller 
(larger) than .05 are marked with (^), which indicate that the corresponding test is conservative (liberal). 
The asymptotic normal approximation to proportions is used in determining the significance of the deviations 
of the empirical sizes from .05. For these proportion tests, we also use a = .05 as the significance level. With 
Nmc = 10000, empirical sizes less than .0464 are deemed conservative, greater than .0536 are deemed liberal 
at a = .05 level. Notice that directional version of Pielou's test Zp is extremely liberal in rejecting the null 
hypothesis, similar to the two-sided version as shown in Ceyhan (2008b). The empirically corrected version 
of Pielou's test has much better size performance compared to Zp, but it is still extremely conservative 
when relative abundances are very different for small samples (i.e., Ui < 30 for both i = 1,2); for similar 
small samples it is liberal; and when the relative abundances are very diff'erent for large samples it is also 
conservative. For Dixon's cell-specific tests, if at least one sample size is small, the normal approximation is 
not appropriate, so he recommends Monte Carlo randomization instead of the asymptotic approximation for 
the corresponding cell-specific tests (Dixon (1994)). For cell (1, 1), when ni < 10 or when ni and n2 are very 
different (i.e., classes have very different relative abundances), the cell count is more likely to be < 5. Hence 
in such cases Dixon's cell-specific test is conservative when rii is small, and is liberal when ni is large. The 
empirical size for Dixon's test for cell (2,2) at sample size combination (711,712) is similar to the one for cell 
(1, 1) at (712, Til), so Dixon's cell-specific tests are symmetric in the sample sizes in terms of size performance. 
When cell counts are > 5 (which happens for large samples with relative abundances not being very different), 
Dixon's cell-specific tests seem to be appropriate (i.e., they have about the desired nominal level). On the 
other hand, Ceyhan's cell-specific test seems to be conservative when both sample sizes are small {rii < 30) 
or the classes have very different relative abundances. Otherwise, they have about the desired nominal level. 
Furthermore, the empirical size estimates of Ceyhan's cell-specific tests for cells (1, 1) and (2, 2) are similar at 
each sample size combination. The size performance of the new directional tests is similar to that of Ceyhan's 
cell-specific tests. 

The differences in the relative abundance of classes seem to affect Dixon's tests more than the other tests. 
See for example cell-specific tests for cell (1,1) for sample sizes (30,50) and (50,100), where Dixon's test 
suggests that class X (i.e., class with the smaller size) is more segregated which is only an artifact of the 
difference in the relative abundance. On the other hand, Ceyhan's cell-specific tests and the new directional 
tests are more robust to differences in the relative abundances. 







Empirical significance levels of the tests 




sizes 






for the two-sided alternatives 






("1,712) 














aj 


an 


(10,10) 


.0454^= 


.0465 


.0452= 


.0459= 


.1280*^ 


.0608^ 


.0503 


.0439= 


(10,30) 


.0306'= 


.0485 


.0413= 


.0420= 


.1429*^ 


.0320= 


.0390= 


.0410= 


(10,50) 


.0270^= 


.0464 


.0390= 


.0396= 


.0664'' 


.0292= 


.0423= 


.0397= 


(30,10) 


.0479 


.0275= 


.0399= 


.0395= 


.1383'' 


.0282= 


.0372= 


.0389= 


(30,30) 


.0507 


.0505 


.0443= 


.0442= 


.1339^ 


.0552^' 


.0465 


.0427= 


(30,50) 


.0590'^ 


.0522 


.0505 


.0510 


.1267^ 


.0531 


.0502 


.0505 


(50,10) 


.0524 


.0263= 


.0378= 


.0367= 


.0654^ 


.0287= 


.0406= 


.0379= 


(50,30) 


.0535 


.0597*^ 


.0462= 


.0476 


.1275^ 


.0534 


.0474 


.0464 


(50,50) 


.0465 


.0469 


.0500 


.0502 


.1397^ 


.0494 


.0520 


.0499 


(50,100) 


.0601^^ 


.0533 


.0514 


.0515 


.1223^ 


.0508 


.0506 


.0519 


(100,50) 


.0490 


.0571*^ 


.0480 


.0477 


.1190^ 


.0463= 


.0470 


.0483 


(100,100) 


.0493 


.0463= 


.0485 


.0486 


.1324^ 


.0524 


.0490 


.0489 



Table 3: The empirical significance levels of the tests for the two-sided alternatives under Ho : 
CSR independence with N^c = 10000, 7ii,7i2 in {10,30,50,100} at a = .05. (=: the empirical size is 
significantly smaller than 0.05; i.e., the test is conservative. ^: the empirical size is significantly larger than 
0.05; i.e., the test is liberal. 



We present the empirical significance levels of the tests for the right-sided alternative (i.e., with respect 
to the segregation alternative) in Table 4. The size labeling and superscripting for conservativeness and 
libcralness are as in Tabic 3. Notice that directional version of Piclou's test for segregation alternative is 
liberal in rejecting the null hypothesis. The empirically corrected version of Piclou's test as in Section 4.3.1 
for the segregation alternative is about the desired level for larger samples, but is conservative or liberal for 
smaller samples. The size performance of Dixon's cell-specific test for cell (1, 1) at (rii,n2) is similar to that 
for cell (2,2) at (n2,ni). On the other hand, at each (71,1,712) the size performance of Ceyhan's cell-specific 
test for cell (1, 1) is similar to that for cell (2,2). Dixon's cell-specific tests are usually liberal, in particular 
for the smaller sample for different relative abundance cases. Ceyhan's cell-specific tests are liberal when 
Tii < 30 for both i = 1, 2 or when the relative abundances are very different. The size performance of the 
new directional tests is similar to that of Ceyhan's cell-specific tests. As in the two-sided version, Ceyhan's 
cell-specific and the new directional tests are more robust to differences in relative abundance of the classes. 







Empirical significance levels of the tests 




sizes 




for the 


segregation (i.e., right-sided) alternatives 




{ni,n2) 




"2,2 


^1,1 






^rnc 


S/ 


aii 


(10,10) 


.0515 


.0489 


.0491 


.0491 


.0844' 


.0422^ 


.0526 


.0499 


(10,30) 


.0960^^ 


.0468 


.0631' 


.0643' 


.0846' 


.0576' 


.0613' 


.0651' 


(10,50) 


.0936' 


.0435^ 


.0684' 


.0677*^ 


.0947' 


.0548' 


.0693' 


.0678' 


(30,10) 


.0430^= 


.0900' 


.0571' 


.0567' 


.0760' 


.0511 


.0545' 


.0575' 


(30,30) 


.0490 


.0530 


.0556' 


.0555' 


.0803' 


.0557' 


.0557' 


.0555' 


(30,50) 


.0652*^ 


.0479 


.0482 


.0484 


.0792' 


.0445= 


.0479 


.0480 


(50,10) 


.0441^= 


.0915' 


.0655' 


.0665' 


.0955' 


.0531 


.0682' 


.0655' 


(50,30) 


.0492 


.0664' 


.0515 


.0509 


.0829' 


.0468 


.0511 


.0511 


(50,50) 


.0577*^ 


.0546' 


.0514 


.0509 


.0804' 


.0421= 


.0526 


.0522 


(50,100) 


.0571*^ 


.0464 


.0509 


.0508 


.0921' 


.0495 


.0524 


.0508 


(100,50) 


.0434^= 


.0584' 


.0499 


.0500 


.0909' 


.0483 


.0512 


.0498 


(100,100) 


.0515 


.0500 


.0485 


.0485 


.0927' 


.0484 


.0485 


.0485 



Table 4: The empirical significance levels of the tests for the segregation (right-sided) alternatives under 
Ho : CSR independence with Nmc = 10000, ni,n2 in {10,30,50, 100} at a = .05. (=: the empirical size is 
significantly smaller than 0.05; i.e., the test is conservative. ': the empirical size is significantly larger than 
0.05; i.e., the test is liberal. 

We present the empirical significance levels of the tests for the left-sided alternative (i.e., with respect 
to the association alternative) in Table 5. The size labeling and superscripting are as in Table 3. Notice 
that directional version of Pielou's test is extremely liberal for all sample size combinations. The empirically 
corrected version is still liberal for most sample sizes and extremely conservative for (10, 50) and (50, 10) 
cases. When the relative abundances of classes are very different (see (10, 50) and (50, 10) cases), both tests 
are severely affected, but the corrected version is extremely conservative. For small samples (n^ < 30) Dixon's 
cell-specific tests are extremely conservative for the cell associated with the smaller sample when the relative 
abundances are very different. For large samples Dixon's cell-specific tests are liberal for the cell associated 
with the smaller sample when the relative abundances are very different. Ceyhan's cell-specific tests are 
extremely conservative when both samples are small and are about the desired level for large samples. New 
tests' size performance is similar to that of Ceyhan's tests. Furthermore, the effect of the differences in relative 
abundances is most severe on Dixon's ccll-spccific tests. 



6 Empirical Power Analysis 

We consider three cases for each of segregation and association alternatives. Based on the empirical size 
estimates provided in Section 5, we omit the directional versions of Pielou's test Zp and the empirically 
corrected version of it (see Section 4.3.1) from further consideration. Zp is extremely liberal in rejecting 
the null hypothesis, so it is likely to give more false alarms than we can tolerate. On the other hand, the 
empirically corrected version is only valid for rectangular regions for similar sample sizes, hence might miss 
the correct pattern due to these restrictions. 
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Empirical significance levels of the tests 
for the association (i.e., left-sided) alternatives 


("1,«2) 












rnc 


aj 


an 


(10,10) 


.0412'= 


.0455^= 


.0467 


.0454^= 


.1574^ 


.0858^' 


.0484 


.0425^= 


(10,30) 


.OOOO'^ 


.0490 


.0342^= 


.0362^= 


.1399^ 


.0600^' 


.0296^= 


.0362^= 


(10,50) 


.OOOO'^ 


.0484 


.0057^= 


.0087^= 


.0574^ 


.0006^= 


.0006^= 


.0086^= 


(30,10) 


.0494 


.0000^= 


.0333^= 


.0319'= 


.1406*^ 


.0556^ 


.0274^= 


.0332^= 


(30,30) 


.0450^= 


.0430^= 


.0504 


.0505 


.1115'^ 


.0537^ 


.0505 


.0505 


(30,50) 


.0611^ 


.0564 


.0494 


.0494 


.1172*^ 


.0600^ 


.0493 


.0495 


(50,10) 


.0545 


.0000^= 


.0080^= 


.0058^= 


.0544*^ 


.0004^= 


.0004^= 


.0079^= 


(50,30) 


.0520 


.0594*^ 


.0475 


.0479 


.1173*^ 


.0572* 


.0467 


.0477 


(50,50) 


.0486 


.0494 


.0503 


.0500 


.1041*^ 


.0580* 


.0522 


.0517 


(50,100) 


.0548^ 


.0491 


.0487 


.0491 


.1090*^ 


.0534 


.0475 


.0486 


(100,50) 


.0485 


.0515 


.0465 


.0464 


.1063*^ 


.0515 


.0453^= 


.0464 


(100,100) 


.0478 


.0493 


.0475 


.0475 


.1092*^ 


.0592* 


.0476 


.0475 



Table 5: The empirical significance levels of the tests for the association (left-sided) alternatives under Ho : 
CSR independence with Nmc — 10000, ni,n2 in {10,30,50,100} aX a — .05. {'^: the empirical size is 
significantly smaller than 0.05; i.e., the test is conservative. ^: the empirical size is significantly larger than 
0.05; i.e., the test is liberal. 

6.1 Empirical Power Analysis under Segregation Alternatives 

For the segregation alternatives, we generate X, ~ U{{Q, 1 - s) x (0, 1 - s)) and Y, " U{{s, 1) x (s, 1)) for 
i = 1, . . . , ni and j = 1, . . . , 712. Notice the level of segregation is determined by the magnitude of s e (0, 1). 
We consider the following three segregation alternatives: 

i7|:s = 1/6, : s = 1/4, and : s = 1/3. (13) 

Observe that, from Hg to Hg'^ , the segregation gets stronger in the sense that X and Y points tend to 
form one-class clumps or clusters. 

The power estimates for the two-sided versions and right-sided versions under segregation alternatives are 
presented in Tables 6 and 7, and plotted in Figures 2 and 3, respectively, where f]^^ and (3^^^ are the empirical 

power estimates for Dixon's and Ceyhan's cell-specific tests for cell {i, i), for i = 1, 2 and (3j and /3j7 are for 
the new directional tests. We omit the power estimates of the tests for the left-sided alternative under the 
segregation alternatives as they are virtually zero. Observe that, for all directional tests, as n = (ni + n2) gets 
larger, the power estimates get larger under each segregation alternative; for the same n = (ni -1-712) values, the 
power estimate is larger for classes with similar sample sizes; and as the segregation gets stronger, the power 
estimates get larger at each (rii, 712) combination. The power estimates for the right-sided tests are all higher 
than their corresponding two-sided estimates (as expected). The power estimates for Ceyhan's cell-specific 
tests and the new versions of the directional tests are similar and are higher than those for Dixon's cell-specific 
tests. Furthermore, version I of the new directional tests seems to have the highest power estimates. 

Considering the empirical significance levels and power estimates, we recommend the version I of the new 
directional tests {Zj) in the right-sided form when testing against the segregation alternatives, as it is at the 
desired level for similar sample sizes, slightly conservative for very different sample sizes, but have higher 
power for each sample size combination. On the other hand, Zj is a conditional test (conditional on column 
sums), while Zjj is unconditional and the empirical size and power estimates are about the same as Zj. Hence 
Zji can also be used instead. 

6.1.1 Empirical Power Analysis under Association Alternatives 

For the association alternatives, we consider three cases. First, we generate X; '-^ U{{0,1) x (0,1)) for 
i = 1, 2, . . . , ni. Then we generate Yj for j ' = 1, 2, . . . , 7^2 as follows. For each j, we pick an i randomly, then 
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Figure 2: The empirical power estimates for Dixon's and Ceyhan's cell-specific tests for cell {i, i), i = 1,2 and 
the new directional tests under the segregation alternatives in the two-class case for the two-sided alternatives. 
/3.f^; and stand for Dixon's and Ceyhan's cell-specific tests for cell i = 1,2, respectively, and /3i and 
Pjj stand for the versions I and II of the new directional tests of segregation. The horizontal axis labels are 
1=(10,10), 2=(10,30), 3=(10,50), 4=(30,30), 5=(30,50), 6=(50,50), 7=(50,100), 8=(100,100). 




Figure 3: The empirical power estimates for Dixon's and Ceyhan's cell-specific tests for cell (z, i), i — 1,2 and 
the new directional tests under the segregation alternatives in the two-class case for the right-sided alternatives 
(which is sensitive for the segregation pattern). The power and horizontal axis labeling is as in Figure 2. 



Empirical power estimates for the two-sided tests 
under the segregation alternatives 




{ni,n2) 










Pi 


Pii 




(10,10) 


.0734 


.0698 


.1068 


.1060 


.1108 


.1026 


(10,30) 


.1436 


.1540 


.1977 


2019 


.2173 


.1973 


(10,50) 


.1639 


.1615 


.2465 


.2491 


.2835 


.2490 


(30,30) 


.2883 


.2783 


.3898 


.3894 


.3932 


.3837 


(50,10) 


.1636 


.1520 


.2395 


.2367 


.2741 


.2395 


(50,50) 


.5091 


.5016 


.6786 


.6793 


.6847 


.6831 




(10,10) 


.2057 


.2044 


.3280 


.3270 


.3322 


.3203 


(10,30) 


.4601 


.4133 


.5725 


.5793 


.6082 


.5724 


(10,50) 


.5420 


.4477 


.6747 


.6794 


.7129 


.6793 


(30,30) 


.7783 


.7769 


.8939 


.8938 


.8946 


.8917 


(50,10) 


.4453 


.5383 


.6756 


.6721 


.7116 


.6754 


(50,50) 


.9543 


.9551 


.9938 


.9936 


.9938 


.9938 




(10,10) 


.5144 


.5121 


.7324 


.7320 


.7327 


.7257 


(10,30) 


.8873 


.7833 


.9402 


.9425 


.9514 


.9400 


(10,50) 


.9353 


.8002 


.9699 


.9711 


.9754 


.9711 


(30,30) 


.9929 


.9915 


.9990 


.9990 


.9990 


.9990 


(50,10) 


.7989 


.9393 


.9720 


.9712 


.9772 


.9720 


(50,50) 


.9999 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 6: The empirical power estimates for the two-sided tests under the segregation alternatives, iJg, -ffg^, 
and Hg^^ for the two-class case with N^c — 10000, for some combinations of rii, n2 € {10, 30, 50} at a = .05. 



generate Yj as Xi + Rj (cos Tj, sin Tj)' where Rj r) with r e (0, 1) and Tj U{0, 2 tt). In the pattern 

generated, appropriate choices of r will imply association between classes X and Y. That is. it will be more 
likely to have {X,Y) or {Y,X) NN pairs than same-class NN pairs (i.e., (X,X) or {Y,Y)). The three values 
of r we consider constitute the following three association alternatives; 

i/^:r = 1/4, : r = 1/7, and iJ^^^ : r = 1/10. (14) 

Observe that, from i?^ to i?^^^, the association gets stronger in the sense that X and Y points tend to occur 
together more and more frequently. By construction, for similar sample sizes the association between X and 
Y are at about the same degree as association between Y and X. For very different sample sizes, smaller 
sample is associated with the larger but the abundance of the larger sample confounds its association with 
the smaller. 

The power estimates for the two-sided versions and left-sided versions under association alternatives are 
presented in Tables 8 and 9, and plotted in Figures 4 and 5, respectively, where power labeling is as in Section 
6.1. We omit the power estimates of the tests for the right-sided alternative under the association alternatives 
as they are virtually zero. Observe that when sample sizes are similar (see ni = n2 cases), for all tests, as 
n = (?ii + 712) gets larger, the power estimates get larger; and as the association gets stronger, the power 
estimates get larger. The power estimates for the left-sided tests are all higher than their corresponding 
two-sided estimates (as expected). For such sample sizes, version I has the highest power estimates. 

For smaller samples, Dixon's test has the highest power, while for larger samples new versions and Ceyhan's 
test have similar power performance, but they have higher power compared to Dixon's tests. The power 
performance is highly dependent on the level of relative abundances of the classes. This might be due to 
the fact that by construction, when class X is much larger than class Y, the two classes are not strongly 
associated, since NN of X points could also be from the same class with a high probability. The lack of 
association when class Y is larger occurs for the same reason. 

Considering the empirical significance levels and power estimates, we recommend the version II of the new 
directional tests (i.e., Zjj) in the left-sided form when testing against the association alternatives, as it is at 
the desired level for most sample sizes, and has considerably higher power for each sample size combination. 
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Figure 4: The empirical power estimates for Dixon's and Ceyhan's cell-specific tests for cell {i, i), i = 1,2 and 
the new directional tests under the association alternatives in the two-class case for the two-sided alternatives. 
The power and horizontal axis labeling is as in Figure 2. 




Figure 5: The empirical power estimates for Dixon's and Ceyhan's cell-specific tests for cell {i, i), i — 1,2 and 
the new directional tests under the association alternatives in the two-class case for the left-sided alternatives, 
(which is sensitive for the association pattern). The power and horizontal axis labeling is as in Figure 2. 
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{ni,n2) 








Pl2 


Pi 


Pii 




(10,10) 


.1495 


.1482 


.1719 


.1707 


.1782 


.1743 


(10,30) 


.3021 


.2401 


.2767 


.2859 


.2939 


.2873 


(10,50) 


.3324 


.2686 


.3226 


.3239 


.3668 


.3239 


(30,30) 


.4313 


.4199 


.5338 


.5342 


.5343 


.5338 


(50, 10) 


.2755 


.3332 


.3226 


.3220 


.3647 


.3226 


(50,50) 


.6863 


.6866 


.7891 


.7895 


.7931 


.7920 




(10,10) 


.3837 


.3719 


.4615 


.4618 


.4724 


.4666 


(10,30) 


.6891 


.5490 


.6788 


.6901 


.6997 


.6908 


(10,50) 


.7494 


.5956 


.7495 


.7514 


.7856 


.7514 


(30,30) 


.8788 


.8761 


.9515 


.9515 


.9515 


.9515 


(50, 10) 


.5995 


.7448 


.7480 


.7454 


.7880 


.7480 


(50,50) 


.9851 


.9864 


.9978 


.9981 


.9982 


.9982 




(10,10) 


.7429 


.7405 


.8386 


.8375 


.8424 


.8399 


(10,30) 


.9670 


.8683 


.9672 


.9706 


.9719 


.9708 


(10,50) 


.9831 


.8968 


.9827 


.9829 


.9866 


.9829 


(30,30) 


.9985 


.9982 


.9997 


.9997 


.9997 


.9997 


(50, 10) 


.8918 


.9836 


.9850 


.9842 


.9893 


.9850 


(50,50) 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 7: The empirical power estimates for the right-sided tests under the segregation alternatives, Hg, Hg^ , 
and Hg^^ for the two-class case with Nmc = 10000, for some combinations of rii, n2 € {10, 30, 50} at a = .05. 

7 Example Data 

We illustrate the tests on four example data sets: Pielou's Douglas-fir/ponderosa pine data, Dixon's swamp 
tree data, pyramidal neuron data, and an artificial data set. 

7.1 Pielou's Data 

Pielou used a completely mapped data that is comprised of ponderosa pine {Pinus ponderosa) and Douglas-fir 
trees {Pseudotsuga menziesii formerly P. taxifolia) from a region in British Columbia (Pielou (1961)). Her 
data is also used by Dixon as an illustrative example (Dixon (1994)). The question of interest is whether 
the two tree species are segregated, associated, or do not significantly deviate from CSR independence. The 
corresponding NNCT and the percentages are provided in Table 10. The percentages for the cells are based 
on the sample sizes of each species, that is, for example, % 86 of Douglas-firs have NNs from Douglas firs, and 
remaining % 15 NNs are from ponderosa pines. The row and column percentages are marginal percentages 
with respect to the total sample size. The percentage values are suggestive of segregation for both species. 

The raw data is not available, but fortunately, Pielou (1961) provided Q — 162 and R = 134. The test 
statistics are provided in Table 11, and the corresponding p- values for the two-sided, right-sided (segregation), 
and left-sided (association) alternatives are provided below the test statistics. Observe that all two-sided tests 
are significant, implying significant deviation from CSR independence or RL; and among the directional tests 
right-sided p- values are significant; hence there is significant segregation between Douglas-firs and ponderosa 
pines. 

7.2 Swamp Tree Data 

Dixon illustrates NN-methods on a 50m x 200m rectangular plot of hardwood swamp in South Carolina, 
USA (Dixon (2002b)). The plot contains 13 different tree species, of which we only consider two, namely, 
bald cypress {Taxodium distichum) and black gum trees (Nyssa sylvatica). The question of interest is whether 
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{ni,n2) 




/3?2 


P?.i 




Pi 


Pii 


K 


(10,10) 


.1349 


.1776 


.1638 


.1689 


.1854 


.1663 


(10,30) 


.0002 


.4366 


.2575 


.2728 


.1744 


.2727 


(10,50) 


.0002 


.4947 


.0686 


.1071 


.0041 


.0998 


(30,10) 


.0725 


.0047 


.0294 


.0275 


.0155 


.0292 


(30,30) 


.1413 


.2434 


.2110 


.2134 


.2149 


.2082 


(30,50) 


.1833 


.3984 


.3268 


.3314 


.3240 


.3300 


(50,10) 


.0547 


.0067 


.0122 


.0114 


.0113 


.0120 


(50,30) 


.0882 


.1484 


.1218 


.1224 


.1281 


.1218 


(50,50) 


.1149 


.2421 


.2151 


.2181 


.2209 


.2144 




(10,10) 


.2499 


.2569 


.2898 


.2900 


.3125 


.2871 


(10,30) 


.0000 


.6463 


.4919 


.5123 


.3722 


.5120 


(10,50) 


.0000 


.7062 


.1959 


.2699 


.0177 


.2539 


(30,10) 


.1818 


.0019 


.0770 


.0703 


.0341 


.0769 


(30,30) 


.4053 


.4457 


.5267 


.5293 


.5314 


.5243 


(30,50) 


.4896 


.6957 


.7196 


.7239 


.7154 


.7228 


(50,10) 


.1268 


.0032 


.0109 


.0071 


.0036 


.0102 


(50,30) 


.2957 


.2735 


.3387 


.3363 


.3392 


.3383 


(50,50) 


.4034 


.4961 


.5824 


.5848 


.5904 


.5802 




(10,10) 


.3038 


.2918 


.3475 


.3471 


.3699 


.3448 


(10,30) 


.0000 


.7364 


.6115 


.6290 


.4871 


.6283 


(10,50) 


.0000 


.7907 


.2885 


.3718 


.0340 


.3517 


(30,10) 


.2957 


.0018 


.1371 


.1258 


.0669 


.1371 


(30,30) 


.6092 


.6011 


.7308 


.7301 


.7338 


.7283 


(30,50) 


.7211 


.8491 


.9052 


.9072 


.9027 


.9066 


(50,10) 


.2277 


.0016 


.0162 


.0089 


.0015 


.0144 


(50,30) 


.5414 


.3919 


.5632 


.5588 


.5573 


.5617 


(50,50) 


.6842 


.6891 


.8289 


.8301 


.8330 


.8278 



Table 8: The empirical power estimates for the two-sided tests under the association alternatives, 7?^, Hj^, 
and H^^^ for the two-class case with N^c = 10000, for some combinations of ni, n2 S {10, 30, 50} at a = .05. 

these tree species arc segregated, associated, or satisfy CSR independence. For more detail on the data, see 
(Dixon (2002b)). The locations of these trees in the study region are plotted in Figure 6 and the corresponding 
NNCT together with percentages are provided in Table 12. Observe that the percentages are suggestive of 
segregation for both species. 

The locations of the tree species can be viewed a priori resulting from different processes, so the more 
appropriate null hypothesis is the CSR independence pattern. Hence our inference will be a conditional one 
(see Remark 4.3). We calculate Q = 178 and R = 156 for this data set. We present Dixon's and Ceyhan's 
cell-specific and new directional test statistics and the associated p- values the two-, right-, and left-sided 
alternatives in Table 13, where Pasy stands for the j3-value based on the asymptotic approximation, is the 
p-value based on 10000 Monte Carlo replication of CSR independence in the same plot and Prand is based on 
Monte Carlo randomization of the labels on the given locations of the trees 10000 times. Notice that Pasy, 
Pmc, and Piand Stie very similar for each test. All tests are significant for the two-sided alternative implying 
deviation from CSR independence, and among the directional tests, right-sided tests are significant, indicating 
significant segregation between black gums and bald cypresses. 

The results based on NNCT-tests pertain to small scale interaction at about the average NN distances. 
We might also be interested in the causes of the segregation and the type and level of interaction between the 
tree species at different scales (i.e., distances between the trees). To answer such questions, we also present 
the second-order analysis of the swamp tree data (Diggle (2003)) using the functions (or some modified 
version of them) provided in spatstat package in R (Baddeley and Turner (2005)). We use Ripley's univariate 
and bivariate L-functions which arc modified versions of his Jv-functions. The estimator K{t) of K{t) is 
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Figure 6: The scatter plots of the locations of black gum trees (circles o) and bald cypress trees (triangles 
A). 
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Figure 7: Ripley's univariate L-functions (top row) Lii{t) — t for z = 0, 1, 2, where i = stands for all data 
combined, z = 1 for black gums, and i = 2 for bald cypresses; and pair correlation functions g(t) for all trees 
combined and for each species (bottom row). Wide dashed lines are the upper and lower (pointwise) 95 % 
confidence bounds for the functions based on Monte Carlo simulation under the CSR independence pattern. 
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/3?2 




/3?2 


/?/ 


/?// 


Hi 


(10,10) 


.1893 


.2299 


.2772 


.2826 


.2936 


.2689 


(10,30) 


.0000 


.5551 


.4619 


.4736 


.4307 


.4725 


(10,50) 


.0000 


.6171 


.2991 


.3553 


.0974 


.3547 


(30, 10) 


.1027 


.0000 


.0884 


.0849 


.0763 


.0883 


(30,30) 


.1926 


.3126 


.3421 


.3422 


.3427 


.3423 


(30,50) 


.3076 


.5147 


.4547 


.4572 


.4482 


.4568 


(50, 10) 


.0748 


.0000 


.0172 


.0118 


.0013 


.0172 


(50,30) 


.1373 


.2559 


.2040 


.2070 


.2080 


.2061 


(50,50) 


.1742 


.3361 


.3202 


.3258 


.3296 


.3283 




(10,10) 


.3195 


.3199 


.4514 


.4512 


.4608 


.4423 


(10,30) 


.0000 


.7566 


.6991 


.7106 


.6701 


.7089 


(10,50) 


.0000 


.8083 


.5407 


.5989 


.2500 


.5982 


(30, 10) 


.2631 


.0000 


.2108 


.2013 


.1829 


.2106 


(30,30) 


.4864 


.5312 


.6919 


.6918 


.6923 


.6920 


(30,50) 


.6556 


.7906 


.8293 


.8310 


.8242 


.8306 


(50, 10) 


.1901 


.0000 


.0490 


.0361 


.0054 


.0487 


(50,30) 


.4129 


.4177 


.4737 


.4735 


.4703 


.4747 


(50,50) 


.5194 


.6117 


.7138 


.7146 


.7206 


.7191 




(10,10) 


.3799 


.3610 


.5218 


.5202 


.5295 


.5133 


(10,30) 


.0000 


.8261 


.8034 


.8135 


.7750 


.8127 


(10,50) 


.0000 


.8757 


.6589 


.7128 


.3467 


.7122 


(30, 10) 


.4010 


.0000 


.3090 


.2942 


.2670 


.3087 


(30,30) 


.6856 


.6714 


.8550 


.8550 


.8552 


.8551 


(30,50) 


.8504 


.9096 


.9525 


.9527 


.9503 


.9527 


(50, 10) 


.3215 


.0000 


.0993 


.0771 


.0107 


.0991 


(50,30) 


.6571 


.5503 


.7010 


.6994 


.6924 


.7012 


(50,50) 


.7819 


.7850 


.9026 


.9015 


.9052 


.9045 



Table 9: The empirical power estimates for the Icft-sidcd tests under the association alternatives, ff^, , 
and H^^^ for the two-class case with = 10000, for some combinations of n\,ni € {10, 30, 50} at a = .05. 



approximately unbiased for K{t) at each fixed t. Bias depends on the geometry of the study area and 
increases with t. For a rectangular region it is recommended to use t values up to 1/4 of the smaller side 
length of the rectangle. So we take the values t € [0, 12.5] in our analysis, since the rectangular region is 
50 X 200 m. But Ripley's iC-function is cumulative, so interpreting the spatial interaction at larger distances 
is problematic (Wicgand et al. (2007)). The (accumulative) pair correlation function g(i) is better for this 
purpose (Stoyan and Stoyan (1994)). The pair correlation function of a (univariate) stationary point process 
is defined as 

where K' [t) is the derivative of K[t). However if g[t) > 0, the pair correlation function estimates might 
have critical behavior for small t since the estimator of variance and hence the bias are considerably large. 
This problem gets worse especially in cluster processes (Stoyan and Stoyan (1996)). See for example Figure 
7 where the confidence bands for smaller t values are much wider compared to those for larger t values. So 
pair correlation function analysis is more reliable for larger distances. In particular, it is safer to use g{t) for 
distances larger than the average NN distance in the data set. We can use Ripley's L-function for distances 
up to the average NN distance, or use NNCT-tests for about the average NN distance. 

Ripley's univariate i-functions and the pair correlation functions for both species combined and each 
species for the swamp tree data are presented in Figure 7. The average NN distance in the swamp tree data 
is 3.08 ± 1.70 m (meamtstandard deviation). So Ripley's L-function is reliable for up to about 3 meters, 
where we see that all trees combined do not significantly deviate from CSR, however, black gums seem to 
be significantly aggregated for distances about [2, 3] meters and bald cypresses are significantly aggregated 





NN 








NN 






D.F. 


P.P. 


sum 




D.F. P.P. 




base 


137 


23 


160 


D.F. 


86 % 15 % 


70 % 


38 


30 


68 


P.P. 


56 % 44 % 


30 % 


sum 


175 


53 


228 


77 % 23 % 


100 % 



Table 10: The NNCT for Pielou's data (left) and the corresponding percentages (right). D.F. = Douglas-fir, 
P.P.= ponderosa pine. 



Test statistics and the associated p- values for Pielou's data 





^11 






^22 


Zp 




mc 


mc 


Zi 


Zu 


test statistics 


4.36 


2.29 


3.63 


3.61 


4.86 


3.81 


3.69 


3.86 


3.92 


3.62 


two-sided 


< .0001 


.0221 


.0003 


.0003 


< .0001 


.0001 






.0001 


.0003 


segregation 


< .0001 


.0110 


.0001 


.0002 


< .0001 






.0001 


< .0001 


.0001 


association 


w 1.0 


.9890 


.9999 


.9998 


w 1.0 




.9999 




w 1.0 


.9999 



Table 11: Test statistics and the associated p- values based on asymptotic approximation for the two-sided 
and directional alternatives for Pielou's data. 



for distances about 3 meters. For other distances in [0,3] meters, the pattern is not significantly different 
from CSR. Hence, segregation of the species detected by the NNCT-tests might be due to different levels 
and types of aggregation of the species in the study region. The pair correlation function is more reliable for 
distances larger than 3 meters. Then we observe that all trees arc significantly aggregated for distances about 
[3,5] meters; black gums are significantly aggregated for about [3,7] and [9,11] meters; and bald cypresses 
are significantly aggregated for about [3, 9] and [10, 12.5] meters. For other distances in [3, 12.5] meters, the 
pattern is not significantly different from CSR. 

We also calculate Ripley's bivariate L-function Ly(t). By construction, Lij(t) is symmetric in i and j in 
theory, that is, Lij{t) = Lji{t) for all i, j. But in practice edge corrections will render it slightly asymmetric, 
i.e., Lij{t) 7^ Lji{t). The corresponding estimates are pretty close in our example, so only one bivariate plot 
is presented. Under CSR independence, we have Lij{t) — t ~ 0. If the bivariate pattern is segregation, then 
Lij{t) — t tends to be negative, if it is association then Lij{t) — t tends to be positive. See (Diggle (2003)) 
for more detail. The same definition of the pair correlation function can be applied to Ripley's bivariate K 
or L-functions as well. The benchmark value of Kij{t) = irt^ corresponds to g{t) = 1; g{t) < 1 suggests 
segregation of the species; and g{t) > 1 suggests association of the species. 

Ripley's bivariate L-function and the bivariate pair correlation function for the species in swamp tree data 
are plotted in Figure 8. For 0-3 meter distances, Ripley's bivariate L-function suggests that the tree species 
are significantly segregated for distances about 0.5 and [1.8,3] meters, and do not significantly deviate from 
CSR for other distances. For 3-12.5 meters, the pair correlation function suggests that the tree species are 
significantly segregated for distances about [5, 7] and 9 meters, and do not significantly deviate from CSR for 
other distances. 

Considering Figures 7 and 8, we observe that Ripley's L and pair correlation functions usually detect the 
same large-scale pattern but at different ranges of distance values. Ripley's L suggests that the particular 
pattern is significant for a wider range of distance values compared to g(t), since values of L at small scales 
confound the values of L at larger scales where g{t) is more reliable to use (Wiegand and Moloney (2004) and 
Loosmore and Ford (2006)). 

7.3 Pyramidal Neuron Data 

This data set consists of the (x, y)-coordinates of pyramidal neurons in area 24, layer 2 of the cingulate 
cortex. The data are taken from a unit square region (unit of measurement unknown) in each of 31 subjects, 
grouped as follows: controls consists of 12 subjects and correspond to cell numbers 1-655, schizoaffectives 
consists of 9 subjects and correspond cell numbers 656-1061, and schizophrenics consists of 10 subjects and 
correspond cell numbers 1062-1400. Controls are the subjects with no previous history of any mental disorder, 
schizoaffective disorder is a psychiatric disorder where both the symptoms of mood disorder and psychosis 



NN 



NN 



B.G. B.C. 


sum B.G. B.G. 




, B.G. 
^^^^ B.G. 


149 33 
43 48 


182 B.G. 
91 B.G. 


82 % 18 % 
47 % 53 % 


67 % 
23 % 


sum 


192 581 


273 


34 % 66 % 


100 % 



Table 12: The NNCT for swamp tree data and the corresponding percentages (in parenthesis) 
gum trees, B.G. = bald cypress trees. 



B.G. = black 



Test statistics and the associated p-values for the swamp tree data 




^11 


^22 


^11 


^22 


Zp 




7°- 


rac 


Zi 


Zu 


test statistics 


4.47 


3.54 


4.62 


4.61 


5.90 


4.62 


4.48 


4.67 


4.76 


4.61 


against the two-sided alternative 


Pasy 


< .0001 


.0004 


< .0001 


< .0001 


< .0001 


< .0001 






< .0001 


< .0001 


Pmc 


< .0001 


.0003 


< .0001 


< .0001 










< .0001 


< .0001 


Prand 


< .0001 


.0004 


< .0001 


< .0001 


< .0001 


< .0001 






< .0001 


< .0001 


against the right-sided (i.e., segregation) alternative 


Pasy 


< .0001 


.0002 


< .0001 


< .0001 


< .0001 






< .0001 


< .0001 


< .0001 


Pmc 


< .0001 


.0002 


< .0001 


< .0001 










< .0001 


< .0001 


Prand 


< .0001 


.0003 


< .0001 


< .0001 










< .0001 


< .0001 


against the Icft-sidcd (i.e., association) alternative 


Pasy 


« 1.0 


.9998 


w 1.0 


w 1.0 


« 1.0 




« 1.0 




w 1.0 


w 1.0 


Pmc 


w 1.0 


.9998 


w 1.0 


w 1.0 










w 1.0 


w 1.0 


Prand 


« 1.0 


.9998 


« 1.0 


w 1.0 










w 1.0 


w 1.0 



Table 13: Test statistics and the associated p-values for the two-sided and directional alternatives for the 
swamp tree data. Pasy, Pmc, and Prand stand for the p- values based on the asymptotic approximation, Monte 
Garlo simulation, and randomization of the tests, respectively. 



occur, and schizophrenia is a psychotic disorder characterized by severely impaired thinking, emotions, and 
behavior. Diggle et al. (1991) applied several methods for the analysis of the spatial distributions of pyramidal 
neurons in the cingulatc cortex of human subjects in three diagnostic groupings. With a scaled Poisson 
analysis they found significant differences between the groups in the mean numbers of neurons in the sampled 
region, as well as a high degree of extra-Poisson variation in the distribution of cell counts within these 
groups. They employed two different functional descriptors of spatial pattern for each subject to investigate 
departures from completely random patterns, both between subjects and between groups, while adjusting for 
cell count differences. Since the distributions of their main functional pattern descriptor and of their derived 
test statistic are unknown, they applied a bootstrap procedure to attach p-values to their findings. 

Since the definition of the rectangular domain for identifying neuron positions is independent of neuronal 
cell density or the pattern and this sampling domain is almost identical for each subject, Diggle et al. (1991) 
merged (i.e., pooled) the data for each group. That is, the pyramidal neuron locations from control subjects 
were pooled into one group, from schizoaffective subjects into another, and schizophrenic subjects into another. 
Although, the spatial distributions between subjects are not the same, we think pooling the data by group 
might reveal more than what might be concealed. Diggle et al. (1991) computed and compared Ripley's 
univariate functions to detect differences between patterns across the three groups. Pattern analysis of 
the cellular arrangements demonstrated significant deviation from GSR in favor of spatial regularity for each 
group. On the pooled data, Geyhan (2009) applied a 3 x 3 NNCT-analysis and Ripley's L-functions and 
found that deviation is toward association of controls with schizoaffectives and vice versa. In this article we 
will only consider the pyramidal neurons of controls and schizoaffectives. 

We plot the locations of these points in the study region in Figure 9 and provide the corresponding NNGT 
together with percentages based on row and column sums in Table 14. Observe that the pyramidal neuron 
locations appear to be somewhat regularly spaced. Also the percentages are slightly smaller for the diagonal 
cells, compared to the marginal (row or column) percentages, which might be interpreted as presence of a 
deviation from GSR independence in favor of association. 




Figure 8: Ripley's bivariate L-function Li2{t) — t (left) and pair correlation function g{t) (right) for the swamp 
tree data. Wide dashed lines arc the upper and lower (pointwise) 95 % confidence bounds for the functions 
based on Monte Carlo simulations under the CSR independence pattern. B.G. = black gums and B.C. = 
bald cypresses. 
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Figure 9: The scatter plots of the locations of neurons of controls (circles o), and schizoaffectives (triangles 
A) in the pyramidal neuron data. 



NN 
Ctrl S.A. 


sum 




NN 
Ctrl S.A. 




, Ctrl 
base „ . 
S.A. 


368 288 
273 136 


656 
409 


Ctrl 
S.A. 


56 % 44 % 
67 % 33 % 


62 % 
38 % 


sum 


641 424 


1065 


60 % 40% 


100 % 



Table 14: The NNCT for the pyramidal neuron data (left) and the corresponding percentages (right). Ctrl 
= Control and S.A. = Schizoaffective. 

The locations of the pyramidal neurons can be viewed a priori resulting from different processes, so the 
more appropriate null hypothesis is the CSR independence pattern. We calculate Q = 668 and R = 668 
for this data set. In Table 15, the cell-specific and the directional test statistics and the associated nd the 
associated values. Observe that Pasy, Pmc, and Prand values are similar for each test. The tests are significant 
except for at .05 level, implying significant deviation from CSR independence. Based on the directional 
tests, this deviation is toward association of controls with schizoaffectives and vice versa, since all tests are 
significant against the left-sided alternative. 



Test statistics and the associated p-velues for 


die pyramidal neuron data 




^11 


^22 


^11 


Z22 


Zp 


Zrac 




mc 


Zi 


Zii 


test statistics 


-2.86 


-1.90 


-2.70 


-2.70 


-3.45 


-2.70 


-2.68 


-2.66 


-2.68 


-2.70 


against the two-sided ahernative 


Pasy 


.0042 


.0575 


.0069 


.0069 


.0006 


.0068 






.0073 


.0069 


Pmc 


???.0042 


.0575 


.0003 


.0003 


.0006 


.0068 






.0073 


.0069 


Pvand 


.0048 


.0633 


.0080 


.0062 










.0064 


.0063 


against the right-sided (i.e., segregation) ahernative 


Pasy 


.9979 


.9713 


.9965 


.9965 


.9997 






.9961 


.9964 


.9965 


Pmc 


???.9979 


.9713 


.0001 


.0002 


.9997 






.9961 


.9964 


.9965 


Pvand 


.9979 


.9613 


.9975 


.9975 










.9975 


.9975 


against the left-sided (i.e., association) ahernative 


Pasy 


.0021 


.0287 


.0035 


.0035 


.0003 




.0037 




.0036 


.0035 


Pmc 


???.9979 


.9713 


.0001 


.0002 


.9997 






.9961 


.9964 


.9965 


Pvand 


.0021 


.0397 


.0025 


.0025 










.0025 


.0025 



Table 15: Test statistics and the associated p- values for the two-sided and directional alternatives for the 
pyramidal neuron data. The labeling of the p-values arc as in Table 13. 

To find out what might be causing the association, and what is the type and level of interaction at different 
scales we plot Ripley's (univariate) L-function and pair correlation function for all data combined and for 
each (pooled) group in Figure 10 where the upper and lower 95 % confidence bounds are also provided. 
The average NN distance for this data set is 0.0155 (± 0.0086)), so we only consider distances up to 0.02 
for Ripley's X-function. For this range of distances Ripley's univariate L-function suggests no significant 
deviation from CSR pattern for all data combined and schizoaffectives, but it indicates significant regularity 
for controls at t « .01. For t G [.02, .25], the pair correlation function suggests that all neurons are significantly 
aggregated for distances about .04, .08, .09, .15, .20, and .22; control neurons are significantly aggregated 
at about .08, .10, and .25; and schizoaffectives are significantly aggregated at about .05, .07, .13 and .18. 
At other distances, the neurons do not significantly deviate from CSR. This is along the lines of the NNCT 
analysis results, which indicate deviation from CSR independence at smaller scales. The significant spatial 
regularity of the controls might explain the association of neurons of controls and schizoaffectives. 

We also plot Ripley's bivariate L-function and pair correlation function together with the upper and lower 
95 % confidence bounds in Figure 11. For distances up to 0.02 for Ripley's bivariate L-function suggests that 
control and schizoaffective neurons are significantly associated with each other. For t € [.02, .25], the pair 
correlation function suggests that control and schizoaffective neurons are significantly associated at about .04 
only. At other distances, the neurons do not significantly deviate from CSR independence. So at smaller 
scales (i.e., t < 0.02) the univariate and bivariate L-functions seem to be in agreement with the NNCT results 
which indicate the association of controls and schizoaffectives. 
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Figure 10: Ripley's univariate L-functions (top row) Lii{t) — t for i = 0, 1, 2, where i = stands for all data 
combined, i — \ for controls and i ~ 2 for schizoafFectives; and pair correlation functions g{t) for all neurons 
combined and for each class (bottom row). Wide dashed lines are the upper and lower (pointwise) 95 % 
confidence bounds for the functions based on Monte Carlo simulation under the CSR independence pattern. 
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Figure 11: Ripley's bivariate L-function Li2{t) — t (left) and pair correlation function g{t) (right) for the 
pyramidal neuron data. Wide dashed lines are the upper and lower (pointwise) 95 % confidence bounds for 
the functions based on Monte Carlo simulations under the CSR independence pattern. Ctrl = Control and 
S.A. = Schizoaffective. 



7.4 Artificial Data 



In this section, we provide an artificial example, a random sample of size 100 (with 50 X-points and 50 
K-points uniformly generated on the miit square). The question of interest is the spatial interaction between 
X and Y classes. We plot the locations of these points in the study region in Figure 12 and the corresponding 
NNCT together with percentages are provided in Table 16. Observe that the percentages are slightly larger 
for the diagonal cells, which might be interpreted as presence of mild (not necessarily significant) segregation 
for both classes. 

NN NN 



X Y 


sum X Y 




base ^ 


29 21 
20 30 


50 X 
50 Y 


58 % 42 % 
40 % 60 % 


50 % 
50 % 


sum 


49 51 


100 


49 % 51% 


100 % 



Table 16: The NNCT for the artificial data and the corresponding percentages (in parenthesis). 




Figure 12: The scatter plots of the locations of X points (circles o) and Y points (triangles A). 

Observe that in Table 17, p- value for the two-sided alternative of the directional test Zp is almost significant 
{p = .0718), and for the right-sided alternative i.e. for segregation it is significant {p = .0359), which might be 
interpreted as evidence of deviation from CSR independence, but given the reservations on the appropriateness 
for testing CSR independence or RL, other tests arc more reliable, and they all give insignificant p-values. 
Furthermore, the plot in Figure 12 is not suggestive of any deviation from CSR independence, and the 
dependence between cell counts confounds the conclusion based on Zp. 

We also plot Ripley's (univariate) L-function and pair correlation function for all data combined and for 
each group in Figure 13. The average NN distance for this data set is 0.05 (± 0.03)), so we only consider 
distances up to 0.05 for Ripley's L-function. For this range of distances Ripley's univariate L-function suggests 
no significant deviation from CSR pattern for each plot. For t G [.05, .25], the pair correlation function 
suggests that all data points do not significantly deviate from CSR; neither do the Y points; but X points are 
significantly aggregated at about .05 and [.22, .24] and at other distances X points do not significantly deviate 
from CSR. This is along the lines of the NNCT analysis results, which indicate no significant deviation from 
CSR independence at smaller scales. 

We also plot Ripley's bivariate L-function and pair correlation function in Figure 14. For distances up 
to 0.05 for Ripley's bivariate L-function suggests that X and Y points are significantly segregated at about 
t = .04. For t € [.05, .25], the pair correlation function suggests that X and Y points are significantly 
segregated at about .14 and .20 only. At other distances, the points do not significantly deviate from CSR 
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Figure 13: Ripley's univariate L-functions (top row) Lii{t) — t for i = 0, 1, 2, where i = stands for all data 
combined, i = 1 for X points and i = 2 for Y points; and pair correlation functions g(t) for all data combined 
and for each class (bottom row). Wide dashed lines are the upper and lower (pointwise) 95 % confidence 
bounds for the functions based on Monte Carlo simulation under the CSR independence pattern. 



Test statistics and the associated p- values for the artificial data 
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test statistics 


1.13 


1.40 


1.49 


1.49 


1.80 


1.41 


1.34 


1.46 


1.49 


1.49 


against the two-sided alternative 


Pasy 


.2570 


.1615 


.1365 


.1356 


.0718 


.1586 






.1360 


.1360 


Pmc 


.2593 


.1614 


.1404 


.1371 










.1414 


.1380 


Prand 


.2842 


.1822 


.1406 


.1431 










.1464 


.1368 


against the right-sided (i.e., segregation) alternative 


Pasy 


.1285 


.0808 


.0682 


.0678 


.0359 






.0726 


.0680 


.0680 


Pmc 


.1323 


.0801 


.0716 


.0701 










.0722 


.0717 


Prand 


.1578 


.1002 


.0723 


.0671 










.0787 


.0787 


against the left-sided (i.e., association) alternative 


Pasy 


.8715 


.9192 


.9318 


.9322 


.9641 




.9106 




.9320 


.9320 


Pmc 


.8695 


.9212 


.9287 


.9302 










.9281 


.9286 


Prand 


.9013 


.9389 


.9316 


.9368 










.9252 


.9252 



Table 17: Test statistics and the associated p-values for the two-sided and directional alternatives for the 
artificial data. The labeling of the p-values are as in Table 13. 

independence. So at smaller scales (i.e., t < 0.02) the univariate and bivariate L-functions seem to be in 
agreement with the NNCT results which indicate no deviation from CSR independence. 
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Figure 14: Ripley's bivariate L-function Li2(i) — i (left) and pair correlation function g(t) (right) for the 
artificial data. Wide dashed lines are the upper and lower (pointwise) 95 % confidence bounds for the 
functions based on Monte Carlo simulations under the CSR independence pattern. Ctrl = Control and S.A. 
= Schizoaffective. 



8 Discussion and Conclusions 

Pielou's and Dixon's segregation tests based on nearest neighbor contingency tables (NNCTs) are x^-tests, 
hence are used for two-sided alternatives. That is, when the null patterns of CSR independence or RL are 
rejected, these tests do not indicate the direction of the alternative pattern, which consist of segregation 
or association patterns. In this article, we discuss directional (i.e., one-sided) tests of segregation based on 
NNCTs. We propose a directional version of Pielou's test by partitioning the test statistic in the usual 
fashion (Bickel and Doksum (1977)). However, the problem that confounds Pielou's test (i.e., the problem 
of dependence between cell counts) is inherited by the directional versions also. This makes the directional 
version of Pielou's test liberal in rejecting the null case of CSR independence or RL. We also consider the 
directional versions of the cell-specific tests due to Dixon (1994) and Ceyhan (2008c). Furthermore, we 
introduce two new directional tests of segregation. 

We discuss the differences in these NNCT-tests, compare the tests using extensive Monte Carlo simu- 
lations under RL and CSR independence and under various segregation and association alternatives. We 



also illustrate the tests on four examples and compare them with Ripley's i-function (Ripley (2004)). We 
demonstrate that under the CSR independence pattern, NNCT-tests are conditional on Q and i?, but not 
under RL. 

Based on our Monte Carlo simulations, we conclude that the asymptotic approximation for the ccU-spccific 
and the directional tests is appropriate only when the corresponding cell count in the NNCT is larger than 
10. When a cell count is less than 10, we recommend the Monte Carlo randomization of these tests. Type 
I error rates (empirical significance levels) of Ceyhan's cell-specific and of the new directional tests are more 
robust to the differences in sample sizes (i.e., differences in relative abundances). Considering the empirical 
significance levels and power estimates of the tests, we recommend version II of the new tests (defined in 
Equation (12)) for the two-sided alternatives provided the sample sizes are not very different. 

The CSR independence pattern assumes that the study region is unbounded for the analyzed pattern, 
which is not the case in practice. Edge effects are a constant problem in the analysis of empirical (i.e., 
bounded) data sets and much effort has gone into the development of edge correction methods (Yamada and 
Rogersen (2003)). So the edge (or boundary) effects might confound the test results if the null pattern is the 
CSR independence. Two correction methods for the edge effects on NNCT-tests, namely buffer zone correction 
and toroidal correction, are investigated in (Ceyhan (2007), Ccyhan (2008b), and Ceyhan (2008a)) where it 
is recommended that inner or outer buffer zone correction for NNCT-tests could be used with the width of 
the buffer area being about the average NN distance. But larger buffer areas are not recommended since they 
are wasteful with little additional gain. On the other hand, toroidal edge correction is recommended with 
points within the average NN distance in the additional copies around the study region. For larger distances, 
the gain might not be worth the effort. We extend these recommendations for the new directional tests also. 

NNCT-tests summarize the pattern in the data set for small scales, more specifically, they provide infor- 
mation on the pattern around the average NN distance between all points. On the other hand, pair correlation 
function g(t) and Ripley's classical K or L-functions and other variants provide information on the pattern 
at various scales. However, the classical L-function is not appropriate for the null pattern of RL when loca- 
tions of the points have spatial inhomogeneity. For such cases. Biggie's D-function (Diggle (2003) p. 131) 
is more appropriate in testing the bivariate spatial clustering at various scales. Our example illustrates that 
for distances around the average NN distance, NNCT-tests and Ripley's bivariate L-function yield similar 
results. 

If significant, the cell-specific test and the new tests (for the two-sided alternative) imply significant 
deviation from the null pattern. Furthermore, the sign of the test statistic will be suggestive of segregation (if 
positive) and association (if negative). But these tests are more powerful against the one-sided alternatives. 
For a data set for which CSR independence is the reasonable null pattern, we recommend the NNCT-tests 
if the question of interest is the spatial interaction at small scales (i.e., about the mean NN distance). One 
can also perform Ripley's K or i-function and only consider distances up to around the average NN distance 
and compare the results with those of NNCT analysis. If the spatial interaction at higher scales is of interest, 
pair correlation function is recommended (Loosmore and Ford (2006)). On the other hand, if the RL pattern 
is the reasonable null pattern for the data, we recommend the NNCT-tests if the small-scale interaction is of 
interest and Biggie's _D-function if the spatial interaction at higher scales is also of interest. 
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